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(57) Spread spectrum technology and the related in- 
serted or added information signal can be used for im- 
plementing watermarking digital audio signals. A known 
processing for retrieving at receiver or decoder side the 
watermark signal information bit from the spread spec- 
trum is convolving the received or replayed spectrum 
with a spreading function that is time-inverse with re- 
spect to the original spreading function. If BPSK modu- 
lation was used for applying the spread spectrum func- 
tion, the output is a peak at the middle of the sequence 



of correlation values, the sign of such peak representing 
the value of the desired watermark signal information 
bit. According to the invention, in order to cope with echo 
distortions, two or more orthogonal spreading sequenc- 
es are used at encoder side with the original or encoded 
audio signal in baseband. When applying the corre- 
spondingtime-inverse orthogonal spreading sequences 
at decoder side, echoes that are longer than each one 
of spreading sequence's lengths can be fully removed. 
The spreading sequences applied can be modified at 
decoder side according to estimated echo delay values. 
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Description 

[0001] The invention relates to a method and an ap- 
paratus for transmitting watermark data bits using a 
spread spectrum, and to a method and an apparatus for 
regaining watermark data bits embedded in a spread 
spectrum. 

Background 

[0002] 'Watermarking' means imperceptible insertion 
of information into multimedia data, e.g. audio data and/ 
or video data. The insertion of additional information da- 
ta, such as a number or a text, into multimedia data is 
performed through slight modification of the original 
multimedia data. Watermarking can be used for e.g. 
copyright protection, labelling (e.g. URL of a site or a 
site's logo), monitoring, tamper proofing, or conditional 
access. 

Applying 'spread spectrum' in a (RF) communications 
system, means that a small baseband signal bandwidth 
is intentionally spread over a larger bandwidth by inject- 
ing or adding a higher-frequency signal, or spreading 
function. As a direct consequence, the energy used for 
transmittingthesignal is spread over awiderbandwidth, 
and appears as noise. 

Spread spectrum technology and the related inserted or 
added information signal can be used for implementing 
watermarking of e.g. digital audio signals, whereby the 
spread spectrum can use the complete audio spectrum 
from 0Hz to one half of the sampling frequency. This 
spectrum carries the information of one bit. In a modifi- 
cation of such systems shorter spread spectrum se- 
quences are used leading to band limited spread spec- 
trum signals : so that several ones of the band limited 
spread spectrum signals can be added at different cen- 
tre frequencies to the audio spectrum, at which centre 
frequencies the original audio signal has been notch fil- 
tered, in order to increase the bitrate of the watermark 
signals and/or to prevent attacks on the watermarked 
signals. In this watermark system the spread spectrum 
signals are modulated on a carrier. 
[0003] A known processing for retrieving at receiver 
or decoder side the watermark signal information bit 
from the spread spectrum is convolving the received or 
replayed spectrum with a spreading function that is time- 
inverse with respect to the original spreading function, 
which kind of processing is also called 'applying a 
matched filter'. If BPSK modulation was used for apply- 
ing the spread spectrum function, the output of this proc- 
ess is a peak at the middle of the sequence of correlation 
values, whereby the sign of such peak represents the 
value of the desired watermark signal information bit, c. 
f . Fig. 5 which shows a negative peak in the convolution 
result. If QPSK was used two peaks will be present in 
the sequence of correlation values, whereby each peak 
represents one bit value. 



Invention 

[0004] This decoding processing works fine in case 
undisturbed signal are received by the decoder. How- 

5 ever, if the received input signals contain multipath or 
echo or reverberation distortions, the convolved output 
signals will contain more than one peak per watermark 
signal information bit (i.e. per convolution result) to be 
decoded so that, e.g. depending on the amplitude or 

10 power of the distortion peaks, it is difficult or in many 
cases even impossible to retrieve the correct watermark 
information bits. 

[0005] A problem to be solved by the invention is to 
increase the robustness of spread spectrum systems 

15 against echo and reverberation distortions, and to re- 
duce the number of erroneously demodulated water- 
mark signal information bits. This problem is solved by 
the methods disclosed in claims 1 to 3. Corresponding 
apparatuses which utilise these methods are disclosed 

20 in claims 6 to 8, respectively. 

[0006] In a first embodiment of the invention, two or 
more orthogonal spreading sequences or functions are 
combined at transmitter or encoder or source side with 
the original or encoded audio signal in baseband, i.e. 

25 without modulating the spreading sequences or func- 
tions on a carrier before combining them with the original 
or encoded audio signal. 'Orthogonal' spreading se- 
quences or functions means that the cross-correlation 
of such sequences yields a zero-value result, or a very 

30 small-value result. When applying the corresponding 
time-inverse orthogonal spreading sequences or func- 
tions at receiver or decoder side, echoes that are longer 
than each one of spreading sequence's or function's 
lengths can be fully removed. 

35 [0007] In a second embodiment of the invention the 
time-inverse versions of not necessarily orthogonal 
spreading sequences or functions are modified at re- 
ceiver or decoder side according to pre-known or esti- 
mated echo delay values and fading parameters. In 

40 case of estimated echo delay values the delay time pe- 
riod measurements/calculations can be repeated for 
several succeeding audio signal frames before a valid 
delay time period value is formed. 
[0008] Advantageously the number of watermark sig- 

45 nal bit errors due to echoes caused by multipath or re- 
verberated reception conditions is substantially de- 
creased. 

[0009] The features of the first and second embodi- 
ment can also be combined in that two or more orthog- 

50 onal baseband spreading sequences or functions are 
used which are being modified at decoder side accord- 
ing to echo delay values and fading parameters. 
[0010] In principle, the inventive method is suited for 
transmitting watermark data bits using a spread spec- 

55 trum. said method including the steps: 

Modulating said watermark data bits on an encoder 
pseudo-noise sequence; 
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Transforming said modulated encoder pseudo- 
noise sequence into the frequency domain and 
shaping it in amplitude according to the masking 
level curve of an audio signal together with which 
the watermark data bit information is to be transmit- 
ted or transferred, and transforming said shaped 
encoder pseudo-noise frequency domain sequence 
back into the time domain: 

Combining said inverse transformed encoder pseu- 
do-noise frequency domain sequence with a cur- 
rent frame of data of said audio signal; 
Transmitting or transferring said combined audio 
signal frame or frames carrying said watermark da- 
ta bits, 

wherein the length of said encoder pseudo-noise se- 
quence is one Nth of the length of a frame of said audio 
signal, N being an integer number greater one, and 
wherein N orthogonal encoder pseudo-noise sequenc- 
es are used per frame of said audio signal for carrying 
out said combining for corresponding sections of a cur- 
rent frame. 

[0011] In principle, the inventive method is suited for 
regaining watermark data bits embedded in a spread 
spectrum, whereby the corresponding original water- 
mark data bits were modulated at encoder side on an 
encoder pseudo-noise sequence and said modulated 
encoder pseudo-noise sequence was transformed into 
the frequency domain and shaped in amplitude accord- 
ing to the masking level curve of an audio signal together 
with which the watermark data bit information was trans- 
mitted or transferred, and said shaped encoder pseudo- 
noise frequency domain sequence was transformed 
back into the time domain and was combined with a cur- 
rent frame of data of said audio signal, wherein the 
length of said encoder pseudo-noise sequence was one 
Nth of the length of a frame of said audio signal, N being 
an integer number greater one, wherein N orthogonal 
encoder pseudo-noise sequences were used per frame 
of said audio signal for carrying out said combining for 
corresponding sections of a current frame, said method 
including the steps: 

Receiving and synchronising said transmitted or 
transferred audio signal; 

Convolving each one of a corresponding section of 
said current frame of data of said audio signal with 
the corresponding one of time-inversed versions of 
the N orthogonal encoder pseudo-noise sequenc- 
es; 

Determining, for each one of said sections, from the 
sign of the peak or peaks of the corresponding con- 
volution result the value of a bit of said watermark 
data. 

[001 2] In principle, the inventive method is also suited 
for regaining watermark data bits embedded in a spread 
spectrum, whereby the corresponding original water- 



mark data bits were modulated at encoder side on an 
encoder pseudo-noise sequence and said modulated 
encoder pseudo-noise sequence was transformed into 
the frequency domain and shaped in amplitude accord- 

5 ing to the masking level curve of an audio signal together 
with wh ich the watermark data bit information was trans- 
mitted or transferred, and said shaped encoder pseudo- 
noise frequency domain sequence was transformed 
back into the time domain and was combined with a cur- 

10 rent frame of data of said audio signal, wherein the 
length of said encoder pseudo-noise sequence corre- 
sponded to the length of a frame of said audio signal 
and said encoder pseudo-noise sequence was used for 
carrying out said combining for a current frame, said 

15 method including the steps: 

Receiving and synchronising said transmitted or 
transferred audio signal; 

Determining in the received audio signal one or 
20 more echoes and the related echo delays; 

Constructing a modified decoder pseudo-noise se- 
quence based on the time-inversed version of said 
encoder pseudo-noise sequence whereby, accord- 
ing to the echo delay or delays determined, corre- 
25 spondingly time-shifted versions of said time-in- 
versed encoder pseudo-noise sequence are com- 
bined in order to construct said modified decoder 
pseudo-noise sequence; 

Convolving said current frame of data of said audio 
30 signal with said modified decoder pseudo-noise se- 
quence; 

Determining from the sign of the peak or peaks of 
the convolution result the value of a bit of said wa- 
termark data. 

35 

[0013] In principle the inventive apparatus is suited for 
transmitting watermark data bits using a spread spec- 
trum, said apparatus including: 

40 - Means for modulating said watermark data bits on 
an encoder pseudo-noise sequence; 
Means for transforming said modulated encoder 
pseudo-noise sequence into the frequency domain 
and for shaping it in amplitude according to the 
45 masking level curve of an audio signal together with 
which the watermark data bit information is to be 
transmitted or transferred, and for transforming said 
shaped encoder pseudo-noise frequency domain 
sequence back into the time domain; 
50 - Means for combining said inverse transformed en- 
coder pseudo-noise frequency domain sequence 
with a current frame of data of said audio signal; 
Means for transmitting or transferring said com- 
bined audio signal frame or frames carrying said 
55 watermark data bits, 

wherein the length of said encoder pseudo-noise 
sequence is one Nth of the length of a frame of said au- 
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dio signal, N being an integer number greater one, 
wherein N orthogonal encoder pseudo-noise sequenc- 
es are used per frame of said audio signal for carrying 
out said combining for corresponding sections of a cur- 
rent frame. 

[0014] In principlethe inventive apparatus issuitedfor 
regaining watermark data bits embedded in a spread 
spectrum, whereby the corresponding original water- 
mark data bits were modulated at encoder side on an 
encoder pseudo-noise sequence and said modulated 
encoder pseudo-noise sequence was transformed into 
the frequency domain and shaped in amplitude accord- 
ing to the masking level curve of an audio signal together 
with which the watermark data bit information was trans- 
mitted or transferred, and said shaped encoder pseudo- 
noise frequency domain sequence was transformed 
back into the time domain and was combined with a cur- 
rent frame of data of said audio signal, wherein the 
length of said encoder pseudo-noise sequence was one 
Nth of the length of a frame of said audio signal, N being 
an integer number greater one, wherein N orthogonal 
encoder pseudo-noise sequences were used per frame 
of said audio signal for carrying out said combining for 
corresponding sections of a current frame, said appara- 
tus including: 



Means for determining in the received audio signal 
one or more echoes and the related echo delays, 
and for constructing a modified decoder pseudo- 
noise sequence based on the time-inversed version 

5 of said encoder pseudo-noise sequence whereby, 
according to the echo delay or delays determined, 
correspondingly time-shifted versions of said time- 
inversed encoder pseudo-noise sequence are com- 
bined in order to construct said modified decoder 

10 pseudo-noise sequence; 

Means for convolving said current frame of data of 
said audio signal with said modified decoder pseu- 
do-noise sequence, and for determining from the 
sign of the peak or peaks of the convolution result 

15 the value of a bit of said watermark data. 

[0016] Advantageous additional embodiments of the 
invention are disclosed in the respective dependent 
claims. 

20 

Drawings 

[0017] Exemplary embodiments of the invention are 
described with reference to the accompanying draw- 
ls ings, which show in: 



Means for receiving and synchronising said trans- Fig. 1 
mitted or transferred audio signal; 

Means for convolving each one of a corresponding Fig. 2 

section of said current frame of data of said audio 30 
signal with the corresponding one of time-inversed Fig. 3 

versions of the N orthogonal encoder pseudo-noise 
sequences, and for determining, for each one of 
said sections, from the sign of the peak or peaks of 
the corresponding convolution result the value of a 35 
bit of said watermark data. 



Watermark signal encoder and watermark sig- 
nal decoder using spread spectrum technique; 
Watermark signal decoder according to the 
second embodiment; 

Spread spectrum signal in the decoder follow- 
ing time-inverse convolution, showing two suc- 
cessive spreading length portions each con- 
taining a (positive) bit signal, the second por- 
tion containing also a (negative) echo of the 
first portion bit signal; 
Fig. 4 The bit signals of Fig. 3 wherein the echo bit 
signal is removed by the inventive features ac- 
cording to the first embodiment; 
40 Fig. 5 Spread spectrum signal in the decoder follow- 
ing time-inverse convolution, showing one 
spreading length portion containing a (nega- 
tive) bit signal without echo signal; 
Fig. 6 The bitsignal of Fig.5 including an echo bitsig- 
45 nal; 

Fig. 7 The bit signal of Fig. 6 wherein the echo bit 
signal is removed by the inventive features ac- 
cording to the second embodiment. 

50 Exemplary embodiments 



[0015] In principlethe inventive apparatus issuitedfor 
regaining watermark data bits embedded in a spread 
spectrum, whereby the corresponding original water- 
mark data bits were modulated at encoder side on an 
encoder pseudo-noise sequence and said modulated 
encoder pseudo-noise sequence was transformed into 
the frequency domain and shaped in amplitude accord- 
ing to the masking level curve of an audio signal together 
with which the watermark data bit information was trans- 
mitted or transferred, and said shaped encoder pseudo- 
noise frequency domain sequence was transformed 
back into the time domain and was combined with a cur- 
rent frame of data of said audio signal, wherein the 
length of said encoder pseudo-noise sequence corre- 
sponded to the length of a frame of said audio signal 
and said encoder pseudo-noise sequence was used for 
carrying out said combining for a current frame, said ap- 
paratus including: 

Means for receiving and synchronising said trans- 
mitted or transferred audio signal; 



[0018] In the watermark signal encoder section in Fig. 
1 an original audio inputsignal AUS is encoded, or proc- 
essed such that the masking level threshold information 
55 for an encoding is retrieved, using a psycho-acoustic 
model calculator PSYMC. The resulting masking level 
threshold information MLAUD for the audio data fre- 
quency spectrum coefficients (resulting e.g. from an 
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FFT or MDCT) of a current audio signal input frame are 
fed together with related control data or coding param- 
eters CTRLD to a watermark shaping and embedding 
stage WATSE. Input watermark data IWATD enter a bit 
value modulation stage BVMOD in which a current bit 
value of the IWATD data is used to correspondingly 
modulate a current encoder pseudo-noise sequence 
section ENCPNSEQ_i. For example, if the current bit 
value is '+1', the encoder pseudo-noise sequence sec- 
tion ENCPNSEQJ is left unchanged whereas, if the cur- 
rent bit value is '0' or '-1 ', the encoder pseudo-noise se- 
quence section ENCPNSEQ_i is inverted. Sequence 
ENCFNSEQ_i consists of e.g. a 'random' distribution of 
'0' or '-1' and VT. If two different sequences 
ENCPNSEQ_1 and ENCPNSEQ_2 are used each of 
which has a length that is one half of the audio data 
frame length (of e.g. 4096 samples), two watermark da- 
ta bits per audio frame can be transmitted. If N different 
sequences ENCPNSEQ_1 to ENCPNSEQ_N are used, 
each one of them has a length of 1/N of the audio data 
frame length, and N watermark data bits per audio frame 
can be transmitted. 

According to the first embodiment of the invention these 
different sequences ENCPNSECM to ENCPNSEQ_N 
are orthogonal. 'Orthogonal' means that any pair of se- 
quences out of the N sequences has a cross-correlation 
that has an output value of zero, or a very small output 
value near zero. According to the second embodiment 
of the invention a single encoder pseudo-noise se- 
quence ENCPNSEQ is used. 

[0019] The pieces of watermark signals WATS result- 
ing from stage BVMOD are combined with, or added to, 
corresponding frame sections of spectral audio data in 
baseband fashion in the watermark shaping and em- 
bedding stage WATSE. This is performed in stage 
WATSE as follows. A current encoder pseudo-noise se- 
quence section ENCPNSEQJ is transformed into the 
frequency domain. In the frequency domain, this se- 
quence is 'shaped' according to, i.e. its amplitudes en- 
velope is made conforming to, the corresponding frame 
section masking level shape or curve in masking level 
threshold information MLAUD. In case there are two en- 
coder pseudo-noise sequence sections per audio 
frame, sequence ENCPNSECM is shaped according 
to the masking level shape or curve in the first half of 
the audio frame and sequence ENCPNSEQ_2 is 
shaped according to the masking level shape or curve 
in the second half of the audio frame. 
Following such shaping the encoder pseudo-noise se- 
quence section ENCPNSEQJ is inversely transformed 
back into the time domain. The inversely transformed 
sequence sections ENCPNSECM to ENCPNSEQ_N 
are added orcombined with thetime domain sampleval- 
ues of the current audio frame. 

As an alternative, the encoder pseudo-noise sequence 
section ENCPNSEQJ as shaped in the frequency do- 
main can be combined with the frequency domain coef- 
ficient values of the current audio frame, whereby an en- 



coded audio signal is transmitted via channel WATAU- 
TRMCH. 

The output signal of stage WATSE passes through 
transmitter stage TRM (which includes e.g. a D/A con- 

5 verter and/or an amplifier) and channel WATAUTRMCH 
to a watermark signal decoder or receiver. 
[0020] Unintended, in the watermarked audio trans- 
mission channel WATAUTRMCH a noise or reverbera- 
tion or echo signal NRE is added. This channel can be 

10 represented by an acoustic connection between a loud- 
speaker and a microphone. 

[0021] In the first-embodiment watermark signal de- 
coder section in Fig. 1 the distorted transmitted signal 
enters a receiver stage REC, wherein e.g. a coarse syn- 

15 chronisation and/or an A/D conversion is performed. Its 
output signal passes through a bit or fine synchronisa- 
tion stage SYNC to a data recovery matched filter stage 
DRECMF, or time-inverse convolution stage DRECMF. 
This stage convolves, or filters, a current incoming audio 

20 frame, or a respective section of this frame, with a de- 
coder pseudo-noise sequence ENCPNSEQJ that is 
pre-known by, or stored in, the decoder and is time-in- 
verse to the related encoder pseudo-noise sequence 
section ENCPNSEQJ. In case two orthogonal encoder 

25 pseudo-noise sequence sections per audio frame were 
used in the watermark signal encoder, sequence 
DECPNSEQ_1 is convolved with the first half of the cur- 
rently received audio frame and sequence 
DECPNSEQ_2 is convolved with the second half of the 

30 currently received audio frame. 

[0022] In Fig. 3 corresponding two successive 
spreading length portions resulting from this time-in- 
verse convolution are depicted, each containing a (pos- 
itive 1 .0 valued peak) bit signal of the transmitted wa- 

35 termark data, the second portion containing also a (neg- 
ative peak) echo of the first portion bit signal. Basically, 
a correctly transmitted watermark bit 'appears' as a peak 
in the middle of the 2*N-1 intermediate correlation re- 
sults. However, due to echo signals a peak could occur 

40 at the same or a different position. The related audio 
data frame had a length of 4096 samples, therefore the 
correlation with DECPNSEQ_1 and with DECPNSEQ_2 
each provides the results for 4095 correlation steps. 
[0023] According to the invention, after the watermark 

45 signal decoder receiver part is synchronised, in orderto 
remove a negative echo peak signal in the 
DECPNSEQ_2 correlation result, either the 'wrong' po- 
sition or the smaller amplitude > '-1 ' (or < '+1 ', respec- 
tively) or both facts are used in stage DRECMF to not 

50 considering such echo peak signals as valid watermark 
data bits, or to remove such echo peaks leading to a 
correlation output signal according to Fig. 4. 
[0024] Stage DRECMF provides the watermark sig- 
nal decoder output watermark data OWATD which, de- 

55 spite the NRE added on the transmission channel, cor- 
respond 100% or nearly 100% to the input watermark 
data IWATD. 

[0025] In the second embodiment of the invention a 
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single encoder pseudo-noise sequence ENCPNSEQ is 
used in the watermark signal encoder and a single cor- 
respondingly time-inverse decoder pseudo-noise se- 
quence DECPNSEQ is used in the watermark signal de- 
coder. Apartfrom that, the watermark signal encoder op- 
erates like in the first embodiment. 
[0026] The watermark signal decoder stages REC, 
SYNC ; and DRECMF operate like in the first embodi- 
ment. However, the output signal of receiver stage REC 
is also fed to an echo detection stage EDET in which 
echo, multipath or reverberation distortions are detect- 
able and the related delays are calculated. The delay 
can be estimated using different known methods, e.g. 
by correlation of the received signal with the unmodified 
decoder pseudo-noise sequence for one or more 
frames. 

In stage EDET a modified decoder spread spectrum or 
pseudo-noise sequence MDECPNSEQ is formed by 
shifting or multiple shifting the position of theoriginal de- 
coder pseudo-noise sequence DECPNSEQ according 
to the calculated delay or delays, respectively. The out- 
put modified decoder spread spectrum sequence 
MDECPNSEQ is the sum of the original sequence 
DECPNSEQ and correspondingly delayed (and possi- 
bly amplified due to fading) versions of the original se- 
quence, whereby the corresponding cut-off tails of the 
delayed versions are not considered. 
[0027] Fig. 5 shows a corresponding convolution 
processing output for a signal received without echo. 
The negative data bit-related peak at position 4096 can 
clearly be seen. 

[0028] Fig. 6 shows a corresponding convolution 
processing output of the same audio frame but including 
an echo. The main peak ; which has an amplitude that 
is even greater than that of the correct peak in the mid- 
dle, is located at the wrong position and has the wrong 
sign or direction. 

[0029] Fig. 7 shows a corresponding convolution 
processing output of the same audio frame, which upon 
receipt included an echo but which was convolved or 
filtered with a correspondingly modified sequence 
MDECPNSEQ. In this case the wrong peak has a sig- 
nificantly reduced amplitude whereas the amplitude of 
the correct data bit peak has the correct amplitude '-1' 
and is therefore correctly identified. 
[0030] In stage DRECMF the delay measurements, 
or the correlation/convolution results, for several (suc- 
ceeding) audio frames are evaluated before a final result 
on the echo delay is formed. 

[0031] As an alternative, the encoder pseudo-noise 
sequence section ENCPNSEQJ or ENCPNSEQ, re- 
spectively, as shaped in the frequency domain can be 
combined with the frequency domain coefficient values 
of the current audio frame, whereby an encoded audio 
signal is transmitted via channel WATAUTRMCH and is 
correspondingly decoded in a watermark signal decod- 
er. 

[0032] The pseudo-noise sequences used are calcu- 



lated by a given algorithm based on a start value. In or- 
der to transmit secret watermark data, the start value or 
even that algorithm can be encrypted and transmitted 
to the watermark signal decoder wherein it is used to 
5 calculate the decoder pseudo-noise sequences 
DECPNSEQJ and the modified decoder pseudo-noise 
sequence MDECPNSEQ. 

[0033] Instead of audio signals, video signals can be 
used correspondingly for transmitting watermark data 
10 according to the invention. 



Claims 

15 1. Method for transmitting watermark data bits 
(IWATD) using a spread spectrum, said method in- 
cluding the steps: 

modulating (BVMOD) said watermark data bits 
20 on an encoder pseudo-noise sequence 

(ENCPNSEQ); 

Transforming (WATSE) said modulated encod- 
er pseudo-noise sequence (WATS) into the fre- 
quency domain and shaping it in amplitude ac- 
25 cording to the masking level curve of an audio 

signal together with which the watermark data 
bit information is to be transmitted or trans- 
ferred, and transforming (WATSE) said shaped 
encoder pseudo-noise frequency domain se- 
30 quence back into the time domain; 

Combining (WATSE) said inverse transformed 
encoder pseudo-noise frequency domain se- 
quence with a current frame of data of said au- 
dio signal; 

35 - Transmitting or transferring (TRM) said com- 
bined audio signal frame or frames carrying 
said watermark data bits, 

wherein the length of said encoder pseudo-noise 
40 sequence (ENCPNSEQ) is one Nth of the length of 
a frame of said audio signal, N being an integer 
number greater one, 

wherein N orthogonal encoder pseudo-noise se- 
quences (ENCPNSEQ) are used per frame of said 
45 audio signal for carrying out said combining for cor- 
responding sections of a current frame. 

2. Method for regaining watermark data bits (IWATD) 
embedded in a spread spectrum, whereby the cor- 

50 responding original watermark data bits were mod- 
ulated (BVMOD) at encoder side on an encoder 
pseudo-noise sequence (ENCPNSEQ) and said 
modulated encoder pseudo-noise sequence 
(WATS) was transformed (WATSE) into the fre- 

55 quency domain and shaped in amplitude according 
to the masking level curve (PSYMC) of an audio sig- 
nal together with which the watermark data bit in- 
formation was transmitted or transferred (TRM), 
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and said shaped encoder pseudo-noise frequency 
domain sequence was transformed (WATSE) back 
into the time domain and was combined with a cur- 
rent frame of data of said audio signal, wherein the 
length of said encoder pseudo-noise sequence 5 
(ENCPNSEQ) was one Nth of the length of a frame 
of said audio signal. N being an integer number 
greater one. wherein N orthogonal encoder pseu- 
do-noise sequences (ENCPNSEQ) were used per 
frame of said audio signal for carrying out said com- 10 
bining for corresponding sections of a current 
frame, 

said method including the steps: 

Receiving (REC, SYNC) and synchronising 15 
said transmitted or transferred audio signal; 
Convolving (DRECMF) each one of a corre- 
sponding section of said current frame of data 
of said audio signal with the corresponding one 
of time-inversed versions (DECPNSEQ) of the 20 
N orthogonal encoder pseudo-noise sequenc- 
es; 

Determining (DRECMF), for each one of said 
sections, from the sign of the peak or peaks of 
the corresponding convolution result the value 25 
of a bit of said watermark data (OWATD). 

3. Method for regaining watermark data bits (IWATD) 
embedded in a spread spectrum, whereby the cor- 
responding original watermark data bits were mod- 30 
ulated (BVMOD) at encoder side on an encoder 
pseudo-noise sequence (ENCPNSEQ) and said 
modulated encoder pseudo-noise sequence 
(WATS) was transformed (WATSE) into the fre- 
quency domain and shaped in amplitude according 35 
to the masking level curve (PSYMC) of an audio sig- 
nal together with which the watermark data bit in- 
formation was transmitted or transferred (TRM), 
and said shaped encoder pseudo-noise frequency 
domain sequence was transformed (WATSE) back 40 
into the time domain and was combined with a cur- 
rent frame of data of said audio signal, wherein the 
length of said encoder pseudo-noise sequence 
(ENCPNSEQ) corresponded to the length of a 
frame of said audio signal and said encoder pseu- 45 
do-noisesequence (ENCPNSEQ) was usedforcar- 
rying out said combining for a current frame, said 
method including the steps: 

Receiving (REC, SYNC) and synchronising 50 
said transmitted or transferred audio signal; 
Determining (EDET) in the received audio sig- 
nal one or more echoes and the related echo 
delays; 

Constructing a modified decoder pseudo-noise 55 
sequence (MDECPNSEQ) based on the time- 
inversed version of said encoder pseudo-noise 
sequence (ENCPNSEQ) whereby, according to 



the echo delay or delays determined, corre- 
spondingly time-shifted versions of said time- 
inversed encoder pseudo-noise sequence are 
combined in order to construct said modified 
decoder pseudo-noise sequence; 
Convolving (DRECMF) said currentframe of da- 
ta of said audio signal with said modified decoder 
pseudo-noise sequence (MDECPNSEQ); 
Determining (DRECMF) from the sign of the 
peak or peaks of the convolution result the val- 
ue of a bit of said watermark data (OWATD). 

4. Method according to claim 3, wherein the length of 
said encoder pseudo-noisesequence (ENCPNSEQ) 
is one Nth of the length of a frame of said audio sig- 
nal, N being an integer number greater one, wherein 
N orthogonal encoder pseudo-noise sequences 
(ENCPNSEQ) were used perframeof said audio sig- 
nal for carrying out said combining for corresponding 
sections of a current frame, 

and wherein, for said constructing step, the N time- 
inversed versions of said orthogonal encoder pseu- 
do-noise sequences (ENCPNSEQ) for a current 
frame are assembled together before applying said 
combining, 

and wherein each one of a corresponding section of 
said current frame of data of said audio signal is con- 
volved (DRECMF) with the corresponding section of 
said modified decoder pseudo-noise sequence 
(MDECPNSEQ), 

and wherein, for each oneof said sections, from 
the sign of the peak or peaks of the correspond- 
ing convolution result the value of a bit of said 
watermark data (OWATD) is determined 
(DRECMF). 

5. Method according to claim 3 or 4 wherein, when de- 
termining (EDET) in the received audio signal one 
or more echoes and the related echo delays, the 
results for several audio frames are evaluated be- 
fore a final result on the echo delay is formed. 

6. Apparatus for transmitting watermark data bits 
(IWATD) using a spread spectrum, said apparatus 
including: 

Means (BVMOD) for modulating said water- 
mark data bits on an encoder pseudo-noise se- 
quence (ENCPNSEQ); 

Means (WATSE) for transforming said modu- 
lated encoder pseudo-noise sequence (WATS) 
into the frequency domain and for shaping it in 
amplitude according to the masking level curve 
of an audio signal together with which the wa- 
termark data bit information is to be transmitted 
or transferred, and for transforming said 
shaped encoder pseudo-noise frequency do- 
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main sequence back into the time domain; 
Means (WATSE) for combining said inverse 
transformed encoder pseudo-noise frequency 
domain sequence with a current frame of data 
of said audio signal; 5 
Means (TRM) for transmitting or transferring 
said combined audio signal frame or frames 
carrying said watermark data bits, 

wherein the length of said encoder pseudo-noise 10 
sequence (ENCPNSEQ) is one Nth of the length of 
a frame of said audio signal. N being an integer 
number greater one, 

wherein N orthogonal encoder pseudo-noise se- 
quences (ENCPNSEQ) are used per frame of said 15 
audio signal forcarrying out said combining for cor- 
responding sections of a current frame. 

7. Apparatus for regaining watermark data bits 
(IWATD) embedded in a spread spectrum, whereby 20 
the corresponding original watermark data bits 
were modulated (BVMOD) at encoder side on an 
encoder pseudo-noise sequence (ENCPNSEQ) 
and said modulated encoder pseudo-noise se- 
quence (WATS) was transformed (WATSE) into the 25 
frequency domain and shaped in amplitude accord- 
ing to the masking level curve (PSYMC) of an audio 
signal together with which the watermark data bit 
information was transmitted or transferred (TRM), 
and said shaped encoder pseudo-noise frequency 30 
domain sequence was transformed (WATSE) back 
into the time domain and was combined with a cur- 
rent frame of data of said audio signal, wherein the 
length of said encoder pseudo-noise sequence 
(ENCPNSEQ) was one Nth of the length of a frame 35 
of said audio signal. N being an integer number 
greater one, wherein N orthogonal encoder pseu- 
do-noise sequences (ENCPNSEQ) were used per 
frame of said audio signal for carrying out said com- 
bining for corresponding sections of a current 40 
frame, said apparatus including: 



were modulated (BVMOD) at encoder side on an 
encoder pseudo-noise sequence (ENCPNSEQ) 
and said modulated encoder pseudo-noise se- 
quence (WATS) was transformed (WATSE) into the 
frequency domain and shaped in amplitude accord- 
ing to the masking level curve (PSYMC) of an audio 
signal together with which the watermark data bit 
information was transmitted or transferred (TRM), 
and said shaped encoder pseudo-noise frequency 
domain sequence was transformed (WATSE) back 
into the time domain and was combined with a cur- 
rent frame of data of said audio signal, wherein the 
length of said encoder pseudo-noise sequence 
(ENCPNSEQ) corresponded to the length of a 
frame of said audio signal and said encoder pseu- 
do-noise sequence (ENCPNSEQ) was used for car- 
rying out said combining for a current frame, said 
apparatus including: 

Means (REC, SYNC) for receiving and syn- 
chronising said transmitted ortransferred audio 
signal; 

Means (EDET) for determining in the received 
audio signal one or more echoes and the related 
echo delays, and for constructing a modified de- 
coder pseudo-noise sequence (MDECPNSEQ) 
based on the time-inversed version of said en- 
coder pseudo-noise sequence (ENCPNSEQ) 
whereby, according to the echo delay or delays 
determined, correspondingly time-shifted ver- 
sions of said time-inversed encoder pseudo- 
noise sequence are combined in order to con- 
struct said modified decoder pseudo-noise se- 
quence; 

Means (DRECMF) for convolving said current 
frame of data of said audio signal with said 
modified decoder pseudo-noise sequence 
(MDECPNSEQ), and for determining from the 
sign of the peak or peaks of the convolution re- 
sult the value of a bit of said watermark data 
(OWATD). 



Means (REC, SYNC) for receiving and syn- 
chronising said transmitted ortransferred audio 
signal; 

Means (DRECMF) for convolving each one of a 
corresponding section of said current frame of 
data of said audio signal with the corresponding 
one of time-inversed versions (DECPNSEQ) of 
the N orthogonal encoder pseudo-noise se- 
quences, and for determining, for each one of 
said sections, from the sign of the peak or peaks 
of the corresponding convolution result the val- 
ue of a bit of said watermark data (OWATD). 

8. Apparatus for regaining watermark data bits 
(IWATD) embedded in a spread spectrum, whereby 
the corresponding original watermark data bits 



9. Apparatus according to claim 8, wherein the length of 
said encoder pseudo-noise sequence (ENCPNSEQ) 

45 is one Nth of the length of a frame of said audio signal, 
N being an integer number greater one, wherein N or- 
thogonal encoder pseudo-noise sequences 
(ENCPNSEQ) were used per frame of said audio sig- 
nal for carrying out said combining for corresponding 

50 sections of a current frame, 

and wherein, in said determining means, the N time- 
inversed versions of said orthogonal encoder pseudo- 
noise sequences (ENCPNSEQ) for a current frame 
are assembled together before applying said combin- 

55 jng, 

and wherein each one of a corresponding section of 
said current frame of data of said audio signal is con- 
volved in said convolving and determining means 
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(DRECMF) with the corresponding section of said 
modified decoder pseudo-noise sequence (MDEC- 
PNSEQ), 

and wherein, for each one of said sections, from 5 
the sign of the peak or peaks of the correspond- 
ing convolution result the value of a bit of said 
watermark data (OWATD) is determined in said 
convolving and determining means (DREC- 
MF). 10 

10. Apparatus according to claim 8 or 9 wherein, in said 
determining means (EDET), in the received audio 
signal one or more echoes and the related echo de- 
lays, the results for several audio frames are eval- 15 
uated before a final result on the echo delay is 
formed. 
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